On the Energy Proportionality of Distributed NoSQL Data Stores
نویسندگان
چکیده
The computing community is facing several big data challenges due to the unprecedented growth in the volume and variety of data. Many large-scale Internet companies use distributed NoSQL data stores to mitigate these challenges. These NoSQL data-store installations require massive computing infrastructure, which consume significant amount of energy and contribute to operational costs. This cost is further aggravated by the lack of energy proportionality in servers. Therefore, in this paper, we study the energy proportionality of servers in the context of a distributed NoSQL data store, namely Apache Cassandra. Towards this goal, we measure the power consumption and performance of a Cassandra cluster. We then use power and resource provisioning techniques to improve the energy proportionality of the cluster and study the feasibility of achieving an energy-proportional data store. Our results show that a hybrid (i.e., power and resource) provisioning technique provides the best power savings — as much as 55%.
منابع مشابه
AutoShard – Declaratively Managing Hot Spot Data Objects in NoSQL Document Stores
NoSQL document stores are becoming increasingly popular as backends in web development. Not only do they scale out to large volumes of data, many systems are even custom-tailored for this domain: NoSQL document stores like Google Cloud Datastore have been designed to support massively parallel reads, and even guarantee strong consistency in updating single data objects. However, strongly consis...
متن کاملNoSQL Databases for RDF: An Empirical Evaluation
Processing large volumes of RDF data requires sophisticated tools. In recent years, much effort was spent on optimizing native RDF stores and on repurposing relational query engines for large-scale RDF processing. Concurrently, a number of new data management systems— regrouped under the NoSQL (for “not only SQL”) umbrella—rapidly rose to prominence and represent today a popular alternative to ...
متن کاملComparative Study of Column Oriented NoSQL Databases on Characteristics
NoSQL database, also called Not Only SQL, is an approach to data management and database design that's useful for very large sets of distributed data. The growing popularity of big data will compel many companies to use NoSQL databases instead of traditional database. Generally, there are three main types of NoSQL databases: key-value stores, column oriented databases and document based stores....
متن کاملSchemaless NoSQL Data Stores - Object-NoSQL Mappers to the Rescue?
NoSQL data stores are becoming increasingly popular in application development. These systems are attractive for developers due to their ability to handle large volumes of data, as well as data with a high degree of structural variety. Typically, NoSQL data stores are accessed programmatically. Due to the imminent lack of standardized query languages, building applications against the native in...
متن کاملDistributed NoSQL Storage for Extreme-Scale System Services
Today with the rapidly accumulated data, datadriven applications are emerging in science and commercial areas. On both HPC systems and clouds the continuously widening performance gap between storage and computing resource prevents us from building scalable data-intensive systems. Distributed NoSQL storage systems are known for their ease of use and attractive performance and are increasingly u...
متن کامل